Natural Induction and Conceptual Clustering: A Review of Applications
نویسندگان
چکیده
Natural induction and conceptual clustering are two methodologies pioneered by the GMU Machine Learning and Inference Laboratory for discovering conceptual relationships in data, and presenting them in the forms easy for people to interpret and understand. The first methodology is for supervised learning (learning from examples) and the second for unsupervised learning (clustering). Examples of their application to a wide range of practical domains are presented, including bioinformatics, medicine, agriculture, volcanology, demographics, intrusion detection and computer user modeling, manufacturing, civil engineering, optimization of functions of very large number of variables (100-1000), design of complex engineering systems, tax fraud detection, and musicology. Most of the results were obtained by applying our recent natural induction program, AQ21, which is downloadable from http://www.mli.gmu.edu/msoftware.html. To give the Reader a quick insight into differences between natural induction implemented in AQ21 and some well-known learning methods, such as those implemented in C4.5, RIPPER, and CN2, as well as between conceptual clustering and conventional clustering, Sections 15 and 16 describe results from applying all these methods to very simple, designed problems.
منابع مشابه
Methodology of conceptual review in the health system
Background: Conceptual review is a creative research method for generating new knowledge in the context of a vague and complex concept that helps to explain and clarify the concept, its components and its relation to related concepts. This study aimed to explain the methodology of conceptual review in the health system. Methods: Articles related to the conceptual research method were searched ...
متن کاملA Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملIs Induction of Anomalies in Lymphocytes of the Residents of High Background Radiation Areas Associated with Increased Cancer Risk?
Man has been exposed to different levels of natural background radiation since the creation of human life. There are inhabited areas around the world with extraordinary levels of natural background radiation. The level of natural radiation in these areas is up to two orders of magnitude higher than other places. Areas such as Yangjiang, China; Guarapari, Brazil; and Kerala, India are among the ...
متن کاملAcquisition of Concept Descriptions by Conceptual Clustering
Case-based object recognition requires a general case of the object that should be detected. Real world applications such as the recognition of biological objects in images cannot be solved by one general case. A case-base is necessary to handle the great natural variations in the appearance of these objects. In this paper we will present how to learn a hierarchical case base of general cases. ...
متن کاملA Corpus-based Conceptual Clustering Method for Verb Frames and Ontology Acquisition
We describe in this paper the ML system, ASIUM, which learns subcategorization frames of verbs and ontologies from syntactic parsing of technical texts in natural language. The restrictions of selection in the subcategorization frames are filled by the concepts of the ontology. Applications requiring subcategorization frames and ontologies are crucial and numerous. The most direct applications ...
متن کامل